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^ (54) Title: MUTATED HIV TAT 

m 

(57) Abstract: The present invention provides a Tat protein wherein all the cysteine residues of the cysteine-rich domain have been 
Q replaced with another amino acid, preferably with serine, nucleic acids encoding it, and methods of using it to elicit a humoral and 
^ cellular immune responses in a mammal. The Tat protein of the invention is therefore useful, inter alia, for prophylactic and/or 
^ therapeutic anti-HW Use as well as raising anti-native Tat antibodies in mammmals. 
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MUTATED HIV TAT 

BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention relates to the field of modified HIV Tat nucleic 
5 acids and proteins as well as its combination with early HIV 
proteins and their use in studying the biological mechanism of HIV 
infection and in vaccine compositions for prophylaxis and treatment 
of HIV/AIDS. 

Summary of the Related Art 

0 HIV Tat protein is an essential viral protein for HIV pathogenesis. 
It transactivates HIV gene expression by binding to the Trans 
Activation Response (TAR) element of the HIV RNA Long Terminal 
Repeat (LTR) region. Tat is released by infected cells in which it 
is expressed (soluble Tat or sTat) and taken up by other HIV 

5 infected cells, where it can enter the nucleus and transactivate HIV 
gene expression. Extracellular Tat induces expression of HIV co- 
receptors on target cells, thereby further promoting virus 
spreading. See generally Noonan et al., Advances in Pharmacology 48, 
229 (20.00) . 

0 Tat also plays a role in HIV-induced immunosuppression. Id. For 
example, Cohen et al., Proc. Natl. Acad. Sci. USA 96, 10842 (1999), 
reported that Tat is strongly immunosuppressive, both immediately 
after immunization of mice with sTat and in seroconverting humans. 
Tat has also been linked to induction of T-cell anergy and T-cell 

5 apoptosis, Ross, Leukemia, 15, 332 (2001). Furthermore, Tosi et al., 
Eur. J. Immunol. 30, 19 (2000), demonstrated that a modified HIV-1 
Tat can act as a immunosuppressor by inhibiting HLA class II 
expression necessary for triggering both cellular and humoral 
responses against pathogens. 

0 The Tat protein is an 86-102 (depending on the HIV strain) amino 
acid protein encoded by two exons. The first, highly conserved exon 
contains four functional domains, including the amino-terminal 
domain (amino acids 1-21) , the cysteine-rich domain (amino acids 22- 
37), the core domain (amino acids 38-48), and the basic domain 

5 (amino acids 49-57), which is essential for cellular uptake. The 
cysteine-rich domain is highly conserved and has been reported as 
being important for the Tat transactivating activity. Individual 
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mutation in six of the seven cysteines eliminates Tat function. 
Jeang in HIV-I Tat: Structure & Function, pp. 3-18, Los Alamos 
National Laboratory (Ed.) Human Retroviruses & AIDS Compendium III. 
Because of its essential role in HIV expression and propagation, Tat 
5 has been suggested and studied as a possible vaccine. Goldstein, 
Nature Medicine, 1, 960 (1996). Cafaro et al., Nature Medicine 5, 
643 (1999) reported that vaccination of cynomolgus monkeys with a 
biologically active HIV-1 Tat protein is safe, elicits a broad 
(humoral and cellular) specific immune response and reduces 
10 infection of the highly pathogenic simian-human immunodeficiency 
virus (SHIV)-89.6P to undetectable levels. 

For human use suppression or inactivation of Tat activity has been 
suggested as a route for prophylaxis and/or treatment of HIV 
infection. E.g., Goldstein, WO 95/31999. Tat protein that has been 
15 modified to reduce or eliminate its transactivating activity while 
maintaining its immunogenicity has been proposed. 

Cohen et al. (supra) reported that oxidation of Tat preserves 
immunogenicity of the protein while inactivating Tat's 
immunosuppressive effects. 

20 Le Buanec and Bizzini., Biomed & Parmacother. 54, 41 (2000), 
reported on chemical inactivation of Tat, e.g., formaldehyde, 
glutaraldehyde, and dithionitrobenzoate treatment as well as 
amidination of lysyl residues, modification of arginyl residues, 
blockade of sulfhydryl groups by dithionitrobenzoate treatment, 

25 maleimidation, carboxymethylation, and carboxyamidation. They found 
that such chemical modification resulted in a Tat protein with a 
partial or complete loss of biological activity but retention of 
partial to complete antigenicity and immunogenicity in mice compared 
to native Tat. Zagury et al. (US 6,200,575) also discloses 

30 formaldehyde and glutaraldehyde inactivation of Tat. 

Another approach has been modification of the protein by mutation. 
Caselli et al. investigated two tat genes mutated in the 
transactivating domain for their ability to elicit an immune 
response to wild-type Tat in a mouse model. The polypeptides encoded 

35 by the two genes, tat 22 (Cys 22 -»Gly) and tat 22 /37 (Cys 22 -»Gly and 
Cys 37 -»Ser) , lack HIV transactivating activity and block wild-type 
Tat. Caselli et al. injected mice with DNA plasmids containing the 
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tat 22 and tat 22 / 37 genes and tested for humoral and cellular response 
to wild-type Tat. A humoral response suggestive of a Thl profile was 
detected after the third immunization, and mean titers and the 
number of responder mice increased following three additional boosts 
5 and treatment with bupivacaine (which facilitates DNA uptake in 
muscle cells and enhances DNA immunization) . The response was 
comparable to DNA immunization with the wild- type tat gene. 
Caselli et al. also immunized mice with wild-type Tat protein and 
observed both humoral and cellular responses. Antibody titers were 
higher in the Tat-immunized mice compared to the tat 22 and tat 22/31 
immunized mice, although epitope reactivities were more restricted 
and a Th-2-like response observed. The authors speculated that the 
differences in DNA and protein immunization response were likely due 
to protein sensitivities to air, light, and temperature and 
differences in presentation of the two to the immune system. Caselli 
et al. asserted that DNA immunization seemed preferable due to the 
presence of a cellular response characteristic of a Thl reaction. 
Zagury, (US6, 200, 575) discloses the use of inactivated Tat and 
various forms of inactivated Tat as immunogens for prophylactic or 
therapeutic immunizations to fight HIV disease. 

Tosi et al., (supra) reported on a tat 22/37 (Cys 22 -»Gly and Cys 37 -»Ser) 
and a tat 37 (Cys 37 ->Ser) mutant, both transfected into T and monocytic 
cell lines. Both mutants were reported to strongly down-modulate 
constitutive as well as IFN-y-inducible HLA class II gene expression 
in vitro, suggesting that these mutants retain the immunosuppressive 
function of the native polypeptide. 

Goldstein, Nature Medicine, supra, suggested that a consensus 
sequences of 21 known HIV-1 Tat proteins could be used as the 
immunogen in a vaccine and further suggested Cys -> Ser substitutions 
could be made at positions 22, 25, 27, and/or 37 to block 
transactivation without affecting the immunogenic domains. 
Goldstein WO 95/31999 suggested inactivation of Tat by deletion at 
the amino or carboxy terminus or deletion or replacement of native 
cysteine residues to interfere with naturally-occurring disulfide 
bonds . 
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Loret (WO 00/61067) discloses Tat protein mutated in the cysteine 
rich region. Most particularly, Loret specifically considers Tat 
OYI, which corresponds to a Tat protein having a natural Cys 22 -»-Ser 
mutation . 

5 Furthermore, Osterhaus et al. has demonstrated the presence of Tat 
and Rev-specific CTL in seropositive long term non progressors 
whereas these CTLs were not found in patients progressing to 
disease. In addition immunization of macaques with a combination of 
vectors expressing the SIV tat and rev genes protected the animals 
10 against pathogenic SIV challenge. Vaccine 17, 27-31, 1999; U.S. 
6024965. 

A recent study by Addo et al. (Proc. Natl. Acad. Sci. USA vol. 98, 
1781-1786) demonstrated that controllers (HIV-1 infected individuals 
capable of controlling viremia without medication) had CTLs 
15 targeting more epitopes in Tat relative to individuals on drug 
treatment. Furthermore, the anti-Tat CTL responses were also of 
higher magnitude in controllers. 

More recently, Allen et al . demonstrated that Tat-specific CTLs are 
involved in controlling wild-type virus replication during SIV 

20 infection of rhesus macaques. Nature 407, Sept 2000, 386-390. 

Despite the tremendous effort that has been dedicated to the study 

of HIV, Tat, early proteins and their role in AIDS, all of the 

molecular biological mechanisms of HIV in general and Tat in 
particular are not completely known or understood. A composition and 

25 method for HIV/AIDS prophylaxis and treatment has also remained 
elusive. Accordingly, there still remains a need for an HIV/AIDS 
vaccine as well as useful research tools to study HIV infection. 
All patents and other publications recited herein are hereby 
incorporated by reference in there entirety. 

30 SUMMARY OF THE INVENTION 

The present invention is based on the discovery that modification of 
HIV Tat protein in the cysteine rich domain by replacing all the 
cysteine residues with other amino acids, preferably serine, results 
in a modified Tat protein that retains its immunogenicity, is unable 
35 to transactivate HIV expression, is not immunosuppressive, and is 
able to induce neutralizing antibodies. The present invention 
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comprises also the simultaneous use of tat, rev and nef genes to 
elicit broad HIV specific T cell responses (including CD4 and CD8 as 
well as innate immunity) . This combination of features makes the 
modified Tat protein of the invention as well as its combination 
with early proteins useful both as a vaccine as well as a research 
tool to study the molecular and systemic mechanisms involved in HIV 
infection. 

The present invention thus provides a Tat protein comprising a 
mutated cysteine-rich domain wherein all the cysteine residues of 
the cysteine-rich domain have been replaced independently with 
another amino acid. 

According to a specific embodiment each cysteine residue of the 
cysteine-rich domain is a conservative substitution and is 
preferably a serine. 

In another aspect, the invention relates to a nucleic acid encoding 
the Tat protein as defined above as well as an expression vector 
comprising said nucleic acid. In alternative embodiments, the said 
vector further comprises a DNA sequence encoding Nef and Rev 
proteins. According to a preferred embodiment, the DNA sequence 
encoding the Rev protein is inserted anywhere into the nef DNA 
sequence encoding amino acids 150-179 of the Nef protein. 
In another aspect, the invention provides a composition comprising 
the above-defined Tat protein or expression vector in combination 
with a carrier and optionally an adjuvant, especially at least one 
Thl adjuvant. Such composition is use for in vitro and in vivo 
administration both as an anti-HIV vaccine as well as for the 
purpose of studying HIV infection. 

The present invention also relates to a method of eliciting a 
humoral and cellular immune response in a mammal comprising 
administering the above-defined composition to the mammal. According 
to a specific embodiment, the composition comprising the Tat protein 
of the invention is administered simultaneously or sequentially with 
the composition comprising the expression vector of the invention. 
The foregoing merely summarizes certain aspects of the invention and 
is not intended, nor should it be constructed, as limiting the 
invention in any manner. Additional details of the invention are 
provided below. All patents, patent applications, and other 
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publications recited in this specification are hereby incorporated 
by reference in their entirety. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 displays the results of immunosuppressive activity of 
5 various Tat measured in vitro by a lymphoprolif eration assay. 

Figure 2 displays anti-Tatni B IgG ELISA titers of guinea pigs 
immunized with various Tat proteins. 

Figure 3 displays the results of the transactivation assay. 
Figure 4 gives the plasmid map of pET8cTat. 

1 0 DETAILED DESCRIPTION OF THE INVENTION 

In a first aspect, the invention thus provides a Tat protein 
comprising a mutated cysteine-rich domain wherein each cysteine 
residue of the cysteine-rich domain has been replaced with another 
amino acid, preferably a conservative amino acid, most preferably a 
15 serine. 

As used herein a "Tat protein" means any naturally occurring Tat 
protein obtained from any HIV-1, HIV-2 or SIV strain, including 
laboratory and primary isolates. The Tat protein is obtained 
preferably from a HIV-1 strain and more particularly from a HIV-1 

20 IIIB strain. Two kinds of Tat proteins have been disclosed in the 
literature i.e., Tat proteins having a short sequence of 86 amino 
acids and Tat proteins having a longer sequence of up to 99 to 102 
amino acids. This difference in size has been attributed to the 
variable length of the second exon encoding the protein. These two 

25 types of proteins fall under the scope of the invention. The amino 
acid sequences of a large number of Tat proteins are known and 
available, e.g., "Human Retroviruses and AIDS 1999: A Compilation 
and Analysis of Nucleic Acid and Amino Acid Sequences," Kuiken et 
al . , Eds., Theoretical Biology and Biophysics Group, Los Alamos 

30 National Laboratory, Los Alamos, NM, and http : //hiv-web . lanl . gov/ , 
and any of these can be used in the present invention. The Tat 
protein is composed of various conserved functional domains, and 
comprises particularly a highly conserved cysteine-rich domain. This 
definition also encompasses the said Tat proteins in which mutations 

35 have been introduced with the proviso that the said proteins contain 
a mutated cysteine-rich domain as defined below and remain devoid of 
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any transactivating and immunosuppressive activity and further 
remain capable of inducing neutralizing antibodies and a cellular 
immune response. The Tat protein of the invention is preferably Tat 
IIIB and corresponds most preferably to SEQ ID No 1. 

5 As used herein, the "mutated cysteine-rich domain" is the sequence 
corresponding to amino acids 22 to 37 of the Tat protein wherein 
each cysteine residue at positions 22, 25, 27, 30, 31, 34 and 37 
have been independently replaced with another amino acid, 
corresponding preferably to a conservative substitution and most 

) preferably to a serine residue. This definition intends also to 
include cysteine-rich domains in which in addition to the above- 
mentioned mutations, additional conservative substitution (s) have 
been introduced in positions different from positions 22, 25, 27, 
30, 31, 34 and 37. Taking as a reference the cysteine-rich domain of 

> Tat IIIB , this definirion includes all cysteine-domains having a 
similarity with IIIB cysteine-rich domain of at least 50%, 
preferably of at least 75%, most preferably of 100%. 

A "conservative amino acid substitution" is a substitution of a 
native amino acid residue with a nonnative residue such that there 

D is little or no effect on the polarity or charge of the amino acid 
residue at that position. A "conservative amino acid substitution" 
also encompasses non-naturally occurring amino acid residues that 
are typically incorporated by chemical peptide synthesis rather than 
by synthesis in biological systems. These include peptidomimetics, 

5 and other reversed or inverted forms of amino acid moieties. 

Naturally occurring residues may be divided into classes based on 
common side chain properties: 

1) hydrophobic: norleucine, Met, Ala, Val, Leu, lie; 

2) neutral hydrophilic: Cys, Ser, Thr; 
0 3) acidic: Asp, Glu; 

4) basic: Asn, Gin, His, Lys, Arg; 

5) residues that influence chain orientation: Gly, Pro; and 

6) aromatic: Trp, Tyr, Phe. 

For example, non-conservative substitutions may involve the exchange 
5 of a member of one of these classes for a member from another class. 
In making such changes, the hydropathic index of amino acids may be 
considered. Each amino acid has been assigned a hydropathic index on 
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the basis of its hydrophobicity and charge characteristics. The 
hydropathic indices are: isoleucine (+4.5); valine (+4.2); leucine 
(+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine 
(+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (- 
5 0.8); tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine 
(-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); 
asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 
The importance of the hydropathic amino acid index in conferring 
interactive biological function on a protein is generally understood 
10 in the art (Kyte et al., 1982, J. Mol . Biol. 157:105-31). It is 
known that certain amino acids may be substituted for other amino 
acids having a similar hydropathic index or score and still retain a 
similar biological activity. In making changes based upon the 
hydropathic index, the substitution of amino acids whose hydropathic 
15 indices are within +2 is preferred, those which are within ±1 are 
particularly preferred, and those within ±0.5 are even more 
particularly preferred. 

It is also understood in the art that the substitution of like amino 
acids can be made effectively on the basis of hydrophilicity, 

20 particularly where the biologically functionally equivalent protein 
or peptide thereby created is intended for use in immunological 
embodiments, as in the present case. The greatest local average 
hydrophilicity of a protein, as governed by the hydrophilicity of 
its adjacent amino acids, correlates with its immunogenicity and 

25 antigenicity, i.e., with a biological property of the protein. 

The following hydrophilicity values have been assigned to these 
amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 
± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); 
glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ± 1); 

30 alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (- 
1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (- 
2.3); phenylalanine (-2.5); and tryptophan (-3.4). In making changes 
based upon similar hydrophilicity values, the substitution of amino 
acids whose hydrophilicity values are within +2 is preferred, those 

35 which are within ±1 are particularly preferred, and those within 
±0.5 are even more particularly preferred. One may also identify 
epitopes from primary amino acid sequences on the basis of 
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hydrophilicity. These regions are also referred to as "epitopic core 
regions . " 

Desired amino acid substitutions (whether conservative or non- 
conservative) can be determined by those skilled in the art at the 
5 time such substitutions are desired. 

The term "similarity" refers to a measure of relatedness that 
includes both identical matches and conservative substitution 
matches between two sequences as determined by a particular 
mathematical model or computer program (i.e., "algorithms") by 
inserting gaps, if required, in one or both sequences. A suitable 
programs available for public use is FAST A. If two polypeptide 
sequences have 10 of 20 identical amino acids, for example, and the 
remainder are all non-conservative substitutions, then the percent 
identity and similarity would both be 50%. If in the same example 
there are five positions in which there are conservative 
substitutions (in addition to the 10 identical residues), then the 
percent identity remains 50%, but the percent similarity would be 
75% (15/20) . 

In a preferred embodiment of the Tat protein of the invention, amino 
acid residues at positions 22, 25, 27, 30, 31, 34, and 37 are serine 
residues (herein called Tat7C/S) . According to a preferred 
embodiment, Tat7C/S corresponds to Tat IIIB 7C/S. 

In another embodiment, the Tat protein of the present invention is 
further modified by chemically methods such as those disclosed in 
U.S. 6,200,575. 

Amino acid numbering used herein is based on the sequence of the 
HIV-1 viral strain III B. The Tat protein of this strain is 
(SEQ ID NO 1) : 

MEPVDPRLEPWKHPGSQPKTACTNCYCKKCCFHCQVCFITKALGISYGRKKRRQRRRPPQGSQTHQVS 
LSKQPTSQSRGDPTGPKE 

Whenever a number of an amino acid residue or sequence is used in 
reference to a sequence other than from the IIIB strain, that number 
refers to the residue or sequence that corresponds to the numbered 
residue or sequence in the IIIB Tat. 

The Tat proteins of the invention can be made routinely using 
methods known in the art. The proteins can be synthesized or, 
preferably, expressed from a vector in a suitable expression system. 
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Vectors and expression of the encoded Tat protein of the invention 
is described fully below. When the Tat protein is produced by 
chemical synthesis, it is possible either to produce it in the form 
of one sequence or in the form of several sequences that are 
5 subsequently linked together in the correct order. The chemical 
synthesis may be carried out on solid phase or in solution, these 
two technologies being well known to the person skilled in the art 
and are described for example by the following authors: Atherton and 
Shepart "solid phase peptide synthesis" (IRL press Oxford, 1989); 

10 Houbenweyl "Method der organischen chemie" editor E. Wunsch vol 15-1 
and II, Stuttgart 1974; Dawon PE and al "Synthesis of proteins by 
native chemical ligation" Science, 1994, 266 (5186): 776-9; 
Kochendoerfer GG and al "Chemical protein synthesis" Curr. Opin. 
Chem. Biol., 1999, 3 (6): 665-71; and Dawson PE and al "Synthesis of 

15 native proteins by chemical ligation" Annu. Rev. Biochem. 2000, 69: 
923-60. The protein thus produced may be easily isolated and 
purified by methods well known in the art. 

The protein of the invention may also be produced by recombinant 
technologies well known in the art. These methods are described in 

20 details in the last edition of "Molecular Cloning: A Molecular 
Manual" by Sambrook et al., Cold Spring Harbor, supra. In such a 
case, the DNA sequence encoding the Tat protein of the invention is 
first produced by directed mutagenesis starting from the wild-type 
DNA sequence encoding Tat. Such a step may be carried out by PCR 

25 using primers containing the DNA sequence encoding the mutation (s) 
to be introduced. The mutated DNA sequence is then inserted into an 
appropriate expression vector. The thus obtained recombinant vector 
is then used to transform appropriate host cells to express the 
mutated Tat protein. The protein thus produced is isolated and 

30 purified using methods well known in the art. A process of 
expression and purification of the protein according to the 
invention is described in details in the attached examples. The 
process of the invention leads advantageously to a highly purified 
monomeric Tat protein which does not form any aggregates. 

35 Concerning the "expression vector," any expression vector 
classically used for the expression of recombinant proteins can be 
used to produce the Tat protein of the invention. "Expression 
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vectors" thus encompass live expression vectors such as viruses and 
bacteria as well as plasmids. Vectors in which the expression of the 
Tat DNA sequence is controlled by an inducible or non-inducible 
strong promoter are advantageously used. Expression vectors may 
5 include a selection marker such as, for example, an antibiotic 
resistance gene (such as Kanamycin) or dihydrof olate reductase gene. 
Non-limitative examples of expression vectors that can be used in 
the process of production of the Tat protein of the invention 
include: pET28 (Novagen) , pBAD (Invitrogen) plasmids; viral vectors 

10 such as baculovirus, adenovirus, adeno-associated virus (AAV), 
poxvirus (including avian pox, fowl pox, and preferably the 
attenuated vaccinia vector NYVAC (U.S. 5,364,773) or MVA (modified 
vaccinia virus Ankara, Swiss Patent No.: 568,392 and U.S. 
5,185,146), and the attenuated canarypox vector ALVAC (U.S. 

15 5,756,103; U.S. 5,990,091), poliovirus, alphavirus, VSV, herpes and 
retroviral vectors, as well as bacterial vectors such as salmonella, 
shigella and BCG. 

To obtain the expression of the Tat protein, any host cell 
classically used in association with the above-mentioned vectors can 
20 be used in the present invention. Non limitative examples of such 
host cells include cells from E. coli such as BL21 (A.DE3) , HB101, 
Topp 10, CAG 1139, cells from bacillus, and eukaryotic cells such as 
Vero, BHK, MRC5, MDCK, FERC-6, and CHO cells. 

The expression system preferably used in the present invention 
25 corresponds to the pM1815/£. coli cells. 

In another aspect, the invention relates to the nucleic acid 
i sequences encoding the above-defined Tat protein of the invention. 

The nucleotide sequences of a large number of tat genes are known 

and available, e.g., on the web site: http : / /hiv-web . lanl . gov/ . 
30 Nucleic acid numbering used herein is based on the following tat DNA 

sequence from HIV-1 viral strain III B (Seq. ID. No.: 2): 

atggagccag tagatcctag actagagccc tggaagcatc caggaagtca gcctaaaact 
gcttgtacca attgctattg taaaaagtgt tgctttcatt gccaagtttg tttcataaca 
35 aaagccttag gcatctccta tggcaggaag aagcggagac agcgacgaag acctcctcaa 
ggcagtcaga ctcatcaagt ttctctatca aagcaaccca cctcccaatc ccgaggggac 
ccgacaggcc cgaaggaata gaagaagaag gtggagagag agacagagac agatccattc 
gattagtgaa 
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"'The bold/underline codon indicates a stop codon at position 259 
(with an X in the corresponding position in the amino acid sequence) 
in the IIIB Tat, which, accordingly is 86 amino acids long. A number 
of naturally occurring Tat sequences have a Glu or Ser codon in 
5 place of this stop codon and have an additional 14 or more amino 
acid residues at the carboxy terminal end. 

Whenever a number of a nucleic acid residue is used in reference to 
a sequence other from the IIIB strain, that number refers to the 
residue that corresponds to the numbered residue in the SEQ ID No 2 
10 sequence. 

When a first nucleic or amino acid residue or sequence within a 
first polynucleotide or polypeptide (respectively) aligns with a 
second nucleic or amino acid residue or sequence within a second 
polynucleotide or polypeptide (respectively) when the two 

15 polynucleotides or polypeptides are brought into alignment using any 
art recognized alignment algorithm, e.g., SIM (Xiaoquin et al., 
Advances in Applied Mathematics 12, 337 (1991)), the first nucleic 
or amino acid residue or sequence within a first polynucleotide or 
polypeptide (respectively) are said to "correspond" one to the 

20 other. 

The codons of the nucleic acids of the invention can be 
advantageously optimized to improve the expression level, the 
selection of the optimized codons depending on the selected host 
cells . 

25 In a third aspect, the invention comprises an expression vector 
encoding the nucleic acid of the second aspect of the invention. 
Expression vectors into which the nucleic acids of the second aspect 
of the invention may be inserted are well known in the art and can 
be routinely selected by those of ordinary skill in the art based 

30 primarily on the host system into which the vector is to be 
inserted. Methods for inserting the nucleic acids of the second 
aspect of the invention into vectors are well known and routinely 
applied. E.g., Sambrook et al., "Molecular Cloning: A Laboratory 
Manual" vols. 1-3 (3 rd Edition, Cold Spring Harbor Laboratory Press, 

35 Cold Spring Harbor, New York 2001) . 

Expression vectors that can be employed in this aspect of the 
invention have been described in detail in the section regarding the 
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process of production of the Tat protein. The expression vectors of 
the present invention can be used either for the production of the 
Tat protein or directly as an active vaccine component of a 
composition of the invention. When the expression vector is used as 
5 a vaccine component, the expression vector to be used does not 
comprise any selection marker and corresponds to a viral vector such 
as adenovirus, poxvirus (including fowl pox, avian pox, and 
preferably the attenuated vaccinia vector NY VAC (US 5,364,773) or 
MVA (modified vaccinia virus Ankara, Swiss Patent No.: 568,392 and 
10 US 5,185,146), and the attenuated canarypox vector ALVAC 
(US 5,756,103; US 5990091), poliovirus, alphavirus, VSV, herpes 
retroviral vector, or a bacterial vector such as salmonella, 
shigella or BCG, or a plasmid DNA vectors including layer DNA 
vectors . 

15 In one embodiment of this aspect of the invention, the nucleic acid 
encoding the modified Tat polypeptide of the invention is the only 
HIV/SIV immunogen encoded by the vector. 

In a preferred embodiment, the vector according to this aspect of 
the invention further comprises nucleic acid sequences encoding the 

20 Rev and Nef HIV-1 proteins. Numerous wild-type rev and nef nucleic 
acid sequences are known. Figures 9-11 and 15-17 display many of 
them, and we contemplate that any of those displayed as well as 
consensus sequences of any two or more of these sequences can be 
used in the invention. In this embodiment, the vector of the 

25 invention comprises a nucleic acid sequence according to the second 
aspect of the invention and both a rev and nef sequence, and the 
vector express the mutated Tat protein of the invention and Rev and 
Nef proteins in the intended host. Preferably, in this embodiment 
the rev DNA sequence is inserted into the nef DNA sequence. 

30 Preferably, the rev DNA sequence is inserted anywhere into the 
region coding for amino acids 150-179 of Nef, thus producing an 
inactivated Nef protein without altering the CTL epitopes of the 
protein. 

As used here "Nef and Rev proteins" means any naturally occurring 
35 Rev and Nef proteins obtained form any HIV-1, HIV-2 or SIV strain 
including laboratory and primary isolates. The Rev and Nef proteins 
are obtained preferably from a HIV-1 strain. The DNA sequences 
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inserted in the expression vector of the invention comprises 
preferably the consensus sequences of the DNA sequences encoding Rev 
and Nef or DNA fragments thereof coding for CTL epitopes. The amino 
acid and nucleotide sequences of Rev and Nef as well as the CTL 
5 epitopes thereof so far identified can be download from the web 
site: http://hiv-web.lanl.gov/. 

In a preferred embodiment, the expression vector expresses the DNA 
sequence encoding the Tat protein under one promoter and the DNA 
sequences encoding Rev and Nef under another promoter. This 
10 construct advantageously produces an immunologically active Tat 
protein capable of being secreted by mammalian cells, taken up by 
mammalian cells, is presented as antigen and is recognized by immune 
cells and/or specific antibody. 

The modified Tat proteins of the invention have several uses. They 
15 can be used alone or as a component of a prophylactic or therapeutic 
vaccine, where its inability to transactivate HIV gene expression 
and induce immunosuppression in a host while retaining its 
immunogenicity and capacity to produce neutralizing antibodies and 
cellular immune response make it both safe and effective for HIV 
20 infection prophylaxis and treatment. 

The transactivating and immunosuppressive activities of the Tat 
protein can be easily determined by the CAT assay and the 
immunosuppression assay, respectively, as described in the attached 
examples. The induction of neutralizing antibodies can be easily 
25 demonstrated by the neutralization assay as described in the 
attached examples. 

The present invention thus also provides compositions, especially 
vaccines, comprising a Tat protein and/or an expression vector as 
defined above in combination with a suitable carrier. 

30 The nature of the carrier will vary depending on the intended 
application. For example, for in vitro assays, the carrier can be a 
simple buffer solution. For prophylactic or therapeutic purposes, 
the carrier can be any pharmaceutically acceptable carrier, many of 
which are known in the art. A pharmaceutically acceptable carrier 

35 will also be desirable for uses in vivo other than treatment or 
prophylaxis, e.g., raising anti-Tat antibodies for use in assays or 
treatment. 
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Methods of making pharmaceutical compositions are well known and can 
be routinely used to make pharmaceutical compositions according to 
the fourth and fifth aspects of the invention. E.g., "Remington: The 
Science and Practice of Pharmacy," by Alfonso R. Gennaro (20th 
5 edition, Lippincott, Williams & Wilkins, Philadelphia, PA, 2000). 

According to one embodiment, the composition comprises Tat7C/S in 
combination with a pharmaceutical^ acceptable carrier. Such a 
composition may be stored in lyophilized form and reconstituted in 
an injectable solution before injection. 
10 The composition of the invention may include one adjuvant such as a 
Thl adjuvant (e.g., CpG sequences or MPL and MPL analogs), or a Th2 
adjuvant (e.g., alum, emulsions, minerals) or a combination adjuvant 
including at least one Thl adjuvant. 

As part of a vaccine the Tat protein of the invention can also be 

15 used in a lipidated form comprising a lipidic part covalently linked 
to the Tat protein. Lipidic parts appropriate to form such lipidated 
Tat as well as a process of preparation of the same can be found 
e.g., in US5993823. The lipidated Tat protein comprises preferably a 
N-s-lysylpalmytoyl residue linked at the COOH terminal function of 

20 the Tat protein. 

As part of a vaccine, it can be the sole immunogen or one of 
several. The Tat protein can be used as the sole immunogen of 
therapeutic anti-HIV vaccine. Preferably the protein of the 
invention is used in combination with an expression vector 

25 expressing the Tat protein of the invention in combination with Rev 
and Nef in order to produce an anti-HIV prophylactic or therapeutic 
vaccine. Tat, Rev and Nef are HIV proteins expressed early during 
the infection cycle, before production of infectious virions. These 
proteins are processed and CTL epitopes are expressed in the context 

30 of HLA class I antigen on the surface of HIV-infected cells. The 
advantage of immunizing humans against these three proteins 
altogether is to induce cytotoxic T cells capable of killing HIV 
infected cells before virions can be produced thus eradicating 
infected cells and preventing HIV replication and spreading. Also, 

35 one of the functions of the viral Nef protein is to down-regulate 
MHC class-I molecule expression on the cell surface and thereby 
confer resistance to immune recognition by CD8 cells. Once the 

SUBSTITUTE SHEET (RULE 26) 



WO 03/054006 PCT/EP02/14841 

- 16 - 

structural proteins are made, there is presumably sufficient Nef 
already present to confer resistance to cytotoxic T cells. The Nef 
used in this embodiment as a vaccine, therefore, should be devoid of 
this activity. 

5 Furthermore, the protein and the expression vector of the invention 
may also be combined with other subunits HIV immunogens or vectors 
encoding the same such as Env, Gag, Pol, Vpr, Vpu and Vif. 
Advantageously, the Tat protein of the invention may be combined 
with the ALVAC constructions, especially ALVAC 1452 and 1433 as 

10 disclosed in US 5990091. 

Such vaccines can be prepared by standard methods well known to 
those of ordinary skill in the art with standard vaccine 
pharmaceutical carriers and, preferably, with an adjuvant. 
In a sixth aspect, the invention provides a method of eliciting a 

15 humoral and cellular immune response in a mammal, comprising 
administering to a subject (preferably human) one or more 
compositions according to the fourth and/or fifth aspect of the 
invention to elicit humoral and cellular immune responses. 
"Cellular immune response" means induction of a specific CD4 T cell 

20 response optionally in association with a specific CD8 T cell 
response and an innate immune response . 

CD4 T cell responses can be monitored upon in vitro recall of 
peripheral or splenic mononuclear cells with the antigen used to 
immunized animals. Lymphoproliferative responses as well as cytokine 
25 inductions (Thl/Th2 balance) can be measured (for a review see MK 
Jenkins, Annu rev Immunol. 2001, 19, 23-45). 

CD8 T cell responses can be evaluated (ex vivo or upon re- 
stimulation of mononuclear cells) either using 1) a standard 
Chromium release assay which directly measures antigen specific 

30 lytic activity (P. Brossard et al., Blood, 90, 1594-1599) or using 
IFNyELISPOT or ICC (intracellular cytokine) assays that both measure 
the ability of CD8 cells to be stimulated by a 9mer peptide specific 
for the antigen versus an irrelevant 9mer peptide (Carvalho LH et 
al., J. Immunol. Methods 2001; 252, 207-18) for IFNyELISPOT and (C 

35 King et al., Nature Medicine, 7, 206-212) for ICC. 

Innate immune responses can be monitored by measuring the leves of 
pro-inflammatory (IL-6, TNFa) and/or anti-viral (type I interferons) 
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cytokines in the serum of immunized animals or upon in vitro antigen 
specific re-stimulations. The early stimulation of innate immunity 
can also be evaluated by assessing the ex vivo activation status of 
antigen presenting cells (monocytes, dendritic cells) and NK cells 
5 that are derived from recently immunized animals (L Krishnan et al., 
J. Immunol. 2001, 166, 1885-1893). 

According to a preferred embodiment, a composition of the invention 
comprising a Tat protein is administered simultaneously or 
sequentially, preferably co-administered, with a composition 
10 comprising an expression vector of the invention, preferably an 
expression vector expressing in addition to the Tat protein of the 
invention the Rev and Nef proteins. 

Suitable amounts of protein for vaccine and other in vivo 
applications are 10 to 500, preferably 20 to 200 ]ig per dose. 
15 Suitable amounts of viral expression vectors are in the range of 10 4 
to 10 11 pfu, and suitable amounts of plasmid expression vectors is 
0.1 to 5 mg per dose. 

Administration according to this aspect of the invention can thus be 
of a protein composition according to the fourth aspect of the 

20 invention, a vector according to the fifth aspect of the invention, 
or both, either simultaneously or sequentially. Furthermore, 
administration may comprise compositions of more than one protein, 
or expression vector. For example, one or a combination of 
composition comprising DNA plasmid plus a viral vector or two 

25 vectors expressing the same genes can be administered (e.g., DNA 
plasmid- tat/re v/nef + Pox- tat /rev/ nef or Alphavirus- tat/re v/nef + 
Pox- tat/re v/nef or) . In an alternative embodiment, administration 
according to this aspect of the invention can be a combination of 
vectors carrying different genes (e.g., vector- tat/re v/nef + vector- 

30 gag/pol/env) . In each instance, the number of injections is 
preferably 2 to 5 for each vector. Furthermore, the number of 
injections is also preferably of 2 to 5 for the composition 
comprising the Tat protein of the invention. 

Administration of the composition of the invention can be carried 
35 out by intradermal, mucosal route or preferably by intramuscular 
injection. 
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The method of this aspect of the invention is useful for 
prophylactic and therapeutic treatment of HIV infection. The method 
is also useful to raise anti-Tat antibodies in a healthy mammal or a 
mammal infected with HIV without further harming the mammal. The 
5 antibodies thereby raised can be harvested and used for treatment, 
for assays, and for the study of the molecular and systemic effects 
of anti-Tat antibodies on HIV infection. 

The Tat protein of the invention can be used to raise anti-wild-type 
Tat antibodies in mammalian systems susceptible to AIDS without 
10 otherwise compromising the health of the mammal. Such antibodies can 
be used to further study the immune response to HIV, in HIV assays, 
as well as to treat HIV infection. 

The Tat protein of the invention can be used to produced monoclonal 
antibodies by methods well known in the art directed against 
15 specific epitopes of the protein. These antibodies could be used for 
passive Immunotherapy of HIV-infected individual in combination with 
chemotherapy and or therapeutic vaccination. 

The said monoclonal antibodies can be used in ELISA assays. They are 
particularly useful as a prognostic tool to detect Tat antigenemia 

20 in course of HIV-inf ection inasmuch as the serum concentration of 
Tat is correlated with the number of HIV-infected cells. 
Furthermore, the Tat protein of the invention can be used in ELISA 
assays to detect anti-Tat antibodies present in the serum of treated 
or non treated HIV-infected patients since high level of anti-Tat 

25 antibodies correlates with non progression to disease as 
demonstrated by Zagury et al. (J. of Human Virology, 1998, 1, 282- 
292) . In such a case the protein of the invention is coated on an 
ELISA plate, contacted with serial dilutions of the patient serum to 
be tested, and then contacted with a enzyme-linked anti-human 

30 antibody. The anti-human antibody /anti-Tat antibody complex thus 
formed is then detected by colorimetric detection. The Tat protein 
of the invention can be advantageously us as a negative control in 
any assay aiming to evaluate the transactivating and/or 
immunosuppressive activity of a Tat protein. 

35 The tat/rev/ nef expression vector of the invention can be used in 
ELISPOT assays to measure cellular responses in seropositive 
individuals as well as vaccinated individuals immunized with a 
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different vector. Indeed, Tat and Rev responses have been shown to 
correlate with long-term non-progression. Carel A. Van Baalen et 
al., J. of General. Virology 78, 1913-1918 (1997). 

Another use of the mutated Tat protein of the invention is as a 
5 research tool to study the immune response to HIV Tat during HIV 
infection. The mutated Tat protein of the invention enables 
scientists to observe the immune response to Tat in a model in vivo 
system without the presence of the complicating molecular processes 
of HIV gene expression and Tat induced immunosuppression. 
10 The following examples further illustrate the invention and are not 
intended, nor should they be construed as limiting the invention in 
any manner. Those skilled in the art will appreciate that variations 
of the Examples provided below can be made in accordance with the 
teachings herein and knowledge common to those skilled in the art 
15 without varying from the scope or spirit of the present invention. 

EXAMPLES 
Example 1 

Construction of plasmid pET8cTat7C/S 
The construction of this clone involved two steps: 

20 I the directed mutagenesis of the WT-tat gene to obtain the 
triple-mutant clone : Cys 30, Cys 31, Cys 34 -» Ser 30, Ser 31, 
Ser 34. 

II the directed mutagenesis of the tat-triple-mutant gene to obtain 
the pET8cTat7C/S plasmid. 

25 Mutagenesis and cloning of the triple mutant of Tat 

We used the recombinant PCR technique to mutate the WT-tat IIIB 
gene. The template was the clone pET8cTat (containing Seq. ID. No.: 
2). The map of this plasmid is given in Figure 4 and its entire DNA 
sequence is given in SEQ ID No 10. The recombinant PCR technique 

30 requires two PCR steps. 

In the first step, two PCR reactions lead to the amplification and 
the mutagenesis of two overlapping fragments: the "5' fragment" and 
the "3' fragment" of the tat gene. 

In the second round, the two overlapping fragments are mixed 
35 together along with 5' and 3' primers to amplify the whole mutated 
tat gene. In the strategy outlined below, nucleotide positions in 
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the PCR primers corresponding to targeted alterations are 
underlined. 

The protocol used was: 

1. First round of PCR : amplification and mutagenesis of the 5' 
5 fragment using the following primers : PBAMU(5' 

CGCGGATCCATGGAGCCAGTAGATCCTA-3 ' ) (SEQ ID No 3) and R8 (5' 
GTTATGAAACAAACTTGGGAATGAAAGGAAGACTTT-3 ' ) (SEQ ID No 4) and 

amplification and mutagenesis of the 3' fragment using the 
following primers: PHINDR (5' CCCCAAGCTTCACTAATCGAATGGATCT-3 ' ) 
10 (SEQ ID No 5) and U8 ( 5 ' -AAAGTCTTCCTTTCATTCCCAAGTTTGTTTCATAAC-3 ' ) 

(SEQ ID No 6) 

2. Purification of these two PCR products using a preparative 2-5 % 
agarose gel and a Qiagen gel extraction Kit (Qiagen, Valencia, 
CA) 

15 3. Second round of PCR : amplification of the whole mutated gene 
using both fragments from the first round of PCR and the two 
primers : PBAMU (SEQ ID No 3) and PHINDR (SEQ ID No5) 

4 . Purification of the 327 bp triple-mutated Tat-gene using a 
preparative 2.5 % agarose gel and a Qiagen gel extraction Kit 

20 (Qiagen, Valencia) 

5. Digestion of these DNA fragment by Hind III and Bam HI and 
purification of the fragment using a Qiagen PCR Extraction Kit. 

6. Ligation of the digested fragment into pET8c vector previously 
digested with Bam HI and Hind III, transformation of XL 10 

25 competent bacteria (Invitrogen, Carlsbad) with the ligation mix 

and mini-preparation of plasmids from cultures grown from the 
transf ormants using Qiagen Mini-Prep kit (Qiagen, Valencia) 

7. Restriction analysis of the clones obtained and DNA sequencing. 

Mutagenesis and cloning of the 7-serine mutant of tat 
30 The recombinant PCR technique was used with the triple mutant clone 
(obtained in the previous step) as template. 

However, we needed 3 PCR steps to successfully amplify and mutate 
the whole gene. We were unsuccessful initially in trying to perform 
the recombinant PCR step with the initial length of overlap, so we 
35 extended the 5' PCR products to increase the length of overlap 
between the two PCR products to be recombined in the final step. By 
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combining the extended mutated 5' fragment with the 3' fragment in a 

third round of PCR using the 5' and 3' terminal primers, we were 

able to generate the full length 7C/S fragment. 

The protocol used was: 
5 1. First round of PCR : amplification and mutagenesis of the 5' 
fragment using the following primers PBAMU (5' 
CGCGGATCCATGGAGCCAGTAGATCCTA-3 * ) (SEQ ID No 3) and R9 (5' 
AAAGGAAGACTTTTTAGAATAGGAATTGGTAGAAGCAGTTTT-3 * ) (SEQ ID No 7) and 
amplification and mutagenesis of the 3' fragment using the 
10 following primers : PHINDR (5' CCCCAAG-CTTCACTAATCGAATGGATCT-3 ' ) 

(SEQ ID No 5) and U10 (5'- TAAAAAGTCTTCCTTTCATTCCCAAGTTT- 
CTTTCATAACAAA-3 ' ) (SEQ ID No 8) 

2. Purification of these two PCR products using a preparative 2.5 % 
agarose gel and a Qiagen gel extraction Kit (Qiagen, Valencia) 
15 3 . These two fragments failed to generate the full length Tat 
fragment in a secondary PCR reaction. Therefore we extended the 
5' fragment to increase the region of overlap. This step # 3 
enabled the extension of the 5' fragment by PCR using the 
primers PBAMU (5* -CGCGGATCCATGGAGCCAGTAGATCCTA-3' ) (SEQ ID No 3) 
20 and Rll (5'- GAAAGAAACTTG-GGAATGAAAGGAAGACTTTTTAGAATAGG-3 ' ) 

(SEQ ID No 9) 

4. Purification of this extended fragment using a preparative 2.5 % 
agarose gel and a Qiagen gel extraction Kit (Qiagen, Valencia) 

5 . Third round of PCR amplification of the whole mutated gene using 
25 both fragments from the first round (fragments 3' , step # 1) and 

second round of PCR (extended fragment 5', step #3) and the two 
primers : PBAMU (SEQ ID No 3) and PHINDR (SEQ ID No 5) 

6. Purification of the 327 bp 7-ser-mutant-Tat-gene using a 
preparative 2.5 % agarose gel and a Qiagen gel extraction Kit 

30 (Qiagen, Valencia) 

7. Digestion of these DNA fragment by Hind III and Bam HI and 
purification of the fragment using a Qiagen PCR Extraction Kit. 

8. Ligation of the digested fragment into pET8c previously digested 
with Bam HI and Hind III, transformation of XL 10 competent 

35 bacteria (Invitrogen, Carlsbad) with the ligation mix and mini- 

preparation of plasmids using Qiagen Mini-Prep kit (Qiagen, 
Valencia) 
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9. Restriction analysis of the clones obtained and DNA sequencing 
for confirmation of the desired construct . 

Example 2 

Construction of plasmid pM1815 
5 The Tat7C/S gene was inserted in the plasmid pET8cTat7C/S of 
example 1 between the BamHl and Hindlll sites. Since the ATG start 
site was immediately downstream of the Bam HI site (ggatccATGg) in 
the pET8cTat7C/S, this created an Ncol site (CCATGG) at the 
translation initiation codon. This Ncol site permitted direct 
10 insertion without modification of the reading frame in the pM1800 
plasmid. This gene was therefore reinserted in this plasmid between 
the 5 'Ncol and 3'HindIII sites. 

The plasmid pM1800 is constructed starting from pET28 (Novagen) . 
pET28c was amplified by PCR using two primers flanking either side 

15 of the region corresponding to the origin fl. The product thus 
amplified corresponds comprises the whole sequence of the vector 
with the exception of the region comprising origin fl. The two 
restriction sites Asc I and Pac I are introduced via the two primers 
used in the PCR reaction. In parallel the cer fragment is amplified 

20 using two primers which lead to a cer fragment inserted between Asc 
I and Pac I sites. The vector and the cer fragment thus amplified 
are digested by the Asc I and Pac I enzymes and then ligated 
together. 

The vector pM1800 thus obtained comprises an expression cassette 
25 under the control of the bacteriophage T7 promoter, a polylinker for 
cloning the genes of interest downstream from the promoter, a 
transcription terminator also derived from bacteriophage T7, the cer 
fragment downstream the polylinker and a kanamycin resistance gene. 
The DNA sequence of plasmid pM1800 is SEQ ID No 10. 
30 The XL 1-Blue strain (Stratagene, La Jolla, CA) was transformed with 
pET8cTat7C/S. Two clones were transplanted and the ADN of the 
plasmid was extracted and digested with Ncol and Hindlll (GIBCO-BRL) 
restriction enzymes in buffers suggested by the manufacturer. The 
Tat7C/S DNA sequence (approximately 300 bp) was then isolated on 2% 
35 agarose gel by electroelution. 
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At the same time, the pM1800 plasmid was also digested by Ncol and 
Hind III and isolated on 1% agarose gel by electroelution. 
The digested Tat fragment and pM1800 plasmid were then subjected to 
ligation with the T4 ligase (GIBCO-BRL) , under the conditions 
5 described by the manufacturer. The ligation product was used to 
transform the E. coli DH10B strain by electroporation, with the 
clones being selected in the presence of kanamycin. 

The plasmid thus produced containing the Tat7C/S DNA sequence is 
named pM1815. 

10 Example 3 

Fermentation, bacterial cell lysis and protein purification 
A seed vial of pM1815 is used to inoculate, a pre-culture of E. coli 
BL21 (XDE3) (in Erlenmeyer flask containing the LB2X medium. After 
15h to 18h agitation at 37°C, the whole content of the flask is 

15 added to 20 L of GluSKYE4 medium (yeast extract, salts and glucose) 
in a 30L B. BraUn fermenter. When the initial growth phase reaches 
cell density up to A600 of 30 ± 5, the synthesis of the Tat protein 
IIIB 7C/S is induced by the addition of an inducer (IPTG ImM final) . 
The culture is still maintained for 3 hours under agitation at 37 °C 

20 and then the medium is chilled down to 10°C before cell harvesting. 
The cells are collected by centrifugation and stored at < -35°C. 
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Thawing of bacterial paste 
(15 g) 


•Cellular paste thawed for 1 night 
at 5 i 3°C 






Suspension and 
homogeni za ti on 


•In a buffer of 50 mM Tris-HCl, 

0.2M NaCl, benzonase* 5 Ul/ml, pH 

8.0 in an ice bath at 5 to 10°C 

using an Ultraturax 

* Addition of benzonase 

extemporaneously 






Cell lysis 


•High-pressure cracking using a 
Panda microf luidizer at 16000 psi 
or 1100 bars 

•Centrif ugation at 20000 g at 5 ± 
3°C, for 2 hours 
•Removal of supernatant and its 
clarification by filtration 
(0.8/0.2 Dm) 


I 




Ammonium sulfate 
precipitation 


•Addition of ammonium sulfate to 
1.5 M concentration 

•Magnetic agitation, for 1 hour at 
room temperature 
•1 hour of rest 

•Centrifugation at lOOOOg at 20 ± 
3°C 

•Re-suspension of ammonium sulfate 
(AS) precipitate in a 50 mM Tris- 
HCl, 8M Urea, 50 mM NaCl, pH8.0 
buffer = AS solution. 
•Filtration through 0.2 ]im membrane 






SP Sepharose FF 
Chromatography 
volume 20 ml 
1.5 cm 
Flow rate 2 ml/min 


•column Equilibrated in a buffer 
of 50 mM Tris-HCl, 50 mM NaCl, 8M 
urea 

•Injection of the filtered AS 
solution followed by rinsing with 
pH balance-restoring buffer 
solution 

•Removal of the flow through 
•Elution with increasing ionic 
strength 

50 mM Tris-HCl, 0.3M NaCl, 8M 

urea, pH 8.0 
su mM Tris— nti, u . dm Nac±, om 
urea, pH 8.0 

50 mM Tris-HCl, 1.5M NaCl, 8M 
urea, pH 8.0 

•Tat7C/S eluted in the NaCl 0.6 M 
eluate 
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The purified Tat protein is stored at -20 °C. The buffer of the Tat 
protein thus purified is preferably replaced with an urea-free 
buffer such as 50mM Tris-HCl pH 7.5. Furthermore, the Tat protein 
needs to be sterilized before injection. This step can be easily 
5 done by sterilizing filtration on 0.2 Dm membrane. The Tat IIIB 7C/S 
thus isolated is greater than 95% pure, as determined by 
densitometric analysis on a blue coomasie-stained SDS-PAGE gel. 
Furthermore, the protein thus purified is substantially exempt of 
any multimeric forms. Indeed, and contrary to the preparation of the 

10 Tat protein of the prior art, the protein thus produced is a 
monomeric protein containing less than 1% of multimeric Tat forms. 
Furthermore, the protein of the invention can be purified at a pH 
near neutrality without forming aggregates. Furthermore, it appears 
that the expression level of the protein of the invention is higher 

15 than the expression level of the corresponding wild-type protein. 
Indeed Wild-type Tat represents 5% of the total soluble proteins 
whereas Tat7C/S represents at least 15% of the total soluble 
proteins . 

Example 4 

20 Neutralization and Neutralization Assays 
Transactivation Assay 

The transactivation assay was developed from G.Tosi et al., Eur. J. 
Immunol. 30, 1120-1126 (2000) and M. Rusnati et al . J. of Biological 
Chemistry 272, 11313-11320 (1997), allowing the biological activity 

25 of the Tat molecule to be determined in vitro. Stably transfected 
HeLa-3Tl cells are carrying a plasmid with the LTR sequences of the 
HIV virus. These LTR sequences function as a promoter for the gene 
of the chloramphenicol acetyl transferase (CAT) which is a reporter. 
The addition of Tat to the culture medium causes the synthesis of 

30 CAT, which can be measured with a commercial ELISA test 
(Boehringer) . The results were standardized in relation to the 
cellular protein concentration. 

Figure 3 is showing the transactivating activity of the native Tat, 
Tat toxoid and Tat7C/S. 

35 Neutralization Assay 
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The incubation of serial dilutions of sera with 40ng/ml of purified 
native Tat prior to transactivation assay, allows to check for 
neutralization of transactivation activity by comparison with 
adequate controls. 

5 Neutralizing titers are expressed as reverse of the last dilution 
able to reduce 90% (llog) of the transactivation signal. 
The following table shows specific antibody titer and neutralizing 
titer: 
Table 1 



Sample tested 


Neutrali zing 
titer 


Specific antibody 
titer (log) 


Cob #075-5 (Tat 
Toxoid) 


<or = 5 


3.79 


Cob #075-33 (Tat7C/S) 


5 


3.45 


Cob#074 (native Tat) 


5 


3.2 


Cob#045 (positive 
control) (hyper-immune 
anti-Tat serum (CFA) ) 


800 


6.3 



10 

These results clearly indicate that Tat7C/S induce antibodies which 
neutralize Tat transactivation activity. The neutralizing titer is 
equivalent to titer obtained with Tat toxoid. 

Moreover, this experiment confirm that the neutralization test is 
15 very sensitive since a neutralizing activity is measured even with 
low titer sera. 

Example 5 

Immunosuppression Assay 

The immunosuppressive activity of Tat was measured in vitro by a 
20 lymphoprolif eration assay. Lymphoproliferation was measured by 
tritiated thymidine incorporation (3H-thymidine) in peripheral blood 
mononuclear cells (PBMCs) after stimulation by a recall antigen 
(previously described in Zagury et al., Proc. Natl. Acad. Sci. U S 
A. 1998; 95:3851-6) . 
25 This assay consisted of isolating, on a ficoll gradient, PBMCs from 
the peripheral blood of a healthy subject and cultivating them in a 
microwell in the presence of recall antigen and declining doses of 
Tat protein in an HL1 culture medium supplemented with 5xl0-5M B- 
mercaptoethanol and 10% AB serum. Each dose of Tat was tested in 
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triplicate. 18 hours before the cessation of the culture, 0.5 mCi of 
tritiated thymidine was added to each microwell. The cells were then 
washed and the incorporated radioactivity was measured with a fluid 
scintillation counter. The results were measured in cpm. 
5 The goal of this test was to characterize the immunosuppressive 
properties of a genetic mutant of Tat. The PBMCs were incubated with 
5 ug of native Tat IIIB, detoxified or Tat7C/S), stimulated by the 
antigens PPD/TT (PPD at 1000 units/ml and TT at 1000 Lf/ml) over a 
period of 5 days. 

10 Detoxified Tat is produced by inactivation of Tat IIIB by an 
alkylation reaction of Tat IIIB (Seq. ID. No.: 1) using 
iodoacetamide in the following conditions: added micromoles of 
iodoacetamide =200 X number of micromoles of Tat + number of 
micromoles of DTT. 

15 The results are presented in Figure 1 as % of immunosuppression, 
calculated as follows: 

0/ . (cpm in cells not treated with Tat) - (cpm in cells treated with Tat) 

%immunosupp ression = v v - 

100 

The data represent 3 experiments performed independently on 3 
different donors . The results show that under conditions where 
20 native Tat inhibits the proliferation of PBMCs by 40%, the mutant of 
Tat7C/S shows no immunosuppressive activity. 
Example 6 

Immunogenicity of the mutant TatHIB 7C/S in the guinea pig 

Five female guinea pigs (Dunkin-Hartley albinos) were injected two 

25 times, at two week intervals, intramuscularly (in the quadriceps) 
with 50 ug of the TatHIB 7C/S. A control group of five guinea pigs 
received, in a similar manner, 50 ug of chemically detoxified 
TatHIB protein (termed "TatHIB toxoid" prepared according to the 
process described in example 5). 

30 The antibody level induced against the native TatHIB protein were 
evaluated by EL ISA before and after each immunizations (Days 1, 14, 
and 29, respectively). The results are displays in figure 2. 
The IgG antibody titers (expressed in loglO) are represented in the 
table 2. The antibody titers of the samples were calculated by 

35 linear regression of a standard an anti-TatHIB hyperimmune serum 
from guinea pig. The titer of this standard serum was first set as 
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the reciprocal of its dilution, giving an optical density at 45 0 - 
650 nm of 1.0 (average titer calculated at the end of several 
independent titrations). Limit of detection set at 0.7 loglO. 
The TatHIB 7C/S was shown to be capable of inducing specific 
5 antibodies against the native TatHIB protein in this guinea pig 
model, with the levels induced after 2 immunizations being very 
close to those evoked by the TatHIB toxoid protein. 



Table 2 

10 



Immunogen 


Guinea 
pig it 


Native anti-TatHIB IgG antibody titers 

(loglO) 


Day 1 


Day 14 


Day 29 


TatHIB 
7C/S 


1 


0 


0.000 


3.327 


2 


0 


0.000 


3.112 


3 


0 


1.711 


3.458 


4 


0 


0.000 


2.834 


5 


0 


0.000 


3.034 


mean 


0 


0.342 


3. 153 


std 
deviation 


0 


0.765 


0.245 


Toxoid 
TatlllB 


6 


0 


0.000 


3.243 


7 


0 


1.327 


2.967 


8 


0 


1.522 


3.384 


9 


0 


1.138 


Dead 


10 


0 


2.321 


3.796 


mean 


0 


1.262 


3.348 


std 
deviation 


0 


0.837 


0.346 
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We claim: 

1. A Tat protein comprising a mutated cysteine-rich domain wherein 
5 all the cysteine residues of the cysteine-rich domain have been 

replaced independently with another amino acid. 

2. The Tat protein according to claim 1, wherein each cysteine 
residue of the cysteine-rich domain is a conservative 

10 substitution. 

3. The Tat protein according to claim 1, wherein each cysteine 
residue of the cysteine-rich domain is a serine. 

15 4. A nucleic acid encoding the Tat protein according to any one of 
claims 1 to 3 . 

5. An expression vector comprising a nucleic acid according to 
claim 4. 

20 

6. The expression vector of claim 5 further comprising a DNA 
sequence encoding Nef and Rev proteins . 

7. The expression vector of claim 7 wherein the DNA sequence 
25 encoding the Rev protein is inserted anywhere into the Nef DNA 

sequence encoding amino acids 150-179 of the Nef protein. 

8. A composition comprising the Tat protein according to any one of 
claims 1 to 3, carrier and optionally and adjuvant. 

30 

9. A composition comprising the expression vector according to any 
one of claims 5 to 7, a carrier and optionally an adjuvant. 

10. The composition of claim 8 or 9 comprising at least one Thl 
35 adjuvant. 
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11. A method of eliciting a humoral and cellular immune response in 
a mammal comprising administering a composition according to any 
one of claims 8 to 10 to the mammal. 

12. The method according to claim 11 wherein the composition of 
claim 8 and the composition of claim 9 are administered 
simultaneously or sequentially. 
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Figure 1 
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Figure 2(a) - Anti-native Tat Hi B IgG ELISA titers in guines pigs : D14 
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Figure 2(b) - Anti-native Tat III B IgG ELISA titers in guines pigs : D29 
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<110> Aventis Pasteur S.A. 
Rappaport, Jay 
Klein, Michel 
Zagury, Jean Francois 

<120> Mutated HIV TAT 

<130> TP019 

<160> 11 

<170> Patentln version 3.0 

<210> 1 
<211> 86 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 1 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 
15 10 15 

Gin Pro Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Phe 
20 25 ~ 30 

His Cys Gin Val Cys Phe lie Thr Lys Ala Leu Gly He Ser Tyr Gly 
35 4 0 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Pro Pro Gin Gly Ser Gin Thr 
50 55 60 

His Gin Val Ser Leu Ser Lys Gin Pro Thr Ser Gin Ser Arg Gly Asp 
65 70 75 80 
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Pro Thr Gly Pro Lys Glu 
85 

<210> 2 
<211> 310 
<212> DNA 

<213> Human immunodeficiency virus type 1 



<400> 2 

atggagccag tagatcctag actagagccc tggaagcatc caggaagtca gcctaaaact 60 

gcttgtacca attgctattg taaaaagcgt tgctttcatt gccaagtttg tttcataaca 120 

aaagccttag gcatctccta tggcaggaag aagcggagac agcgacgaag acctcctcaa 180 

ggcagtcaga ctcatcaagt ttctctacca aagcaaccca cctcccaatc ccgaggggac 240 

ccgacaggcc cgaaggaata gaagaagaag gtggagagag agacagagac agatccattc 300 

gattagtgaa 310 

<210> 3 

<211> 28 

<212> DNA 

<213> Artificial 



<220> 

<223> PCR primer; PBAMU 

<400> 3 

cgcggatcca tggagccagt agatccta 28 

<210> 4 

<211> 36 

<212> DNA 

<213> Artificial 



<220> 
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<223> PCR primer; R8 
<400> 4 

gttatgaaac aaacttggga atgaaaggaa gacttt 

<210> 5 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer; PHINDR 
<400> 5 

ccccaagctt cactaatcga atggatcc 

<210> 6 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer; U8 
<400> 6 

aaagtcttcc tttcattccc aagtttgctt cataac 

<210> 7 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer; R9 
<400> 7 
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4 

aaaggaagac tttttagaat aggaattggt agaagcagtt tt 

<210> 8 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer; U10 
<400> 8 

taaaaagtct tcctttcatt cccaagtctc tttcataaca aa 42 

<210> 9 

<211> 41 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer; Rll 
<400> 9 

gaaagaaact tgggaatgaa aggaagactt tttagaatag g 41 

<210> 10 

<211> 5315 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid pM1800 
<400> 10 

tggcgaatgc cttaattaag gcggggcaca actcaatttg cgggtactga ttaccgcagc 
aaagacctta ccccgaaaaa atccaggctg ctggctgaca cgatttctgc ggtttatctc 
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120 



WO 03/054006 PCT/EP02/14841 

5 



gatggctacg 


agggcagaca 


gtaagtggat 


ttaccataat 


cccttaattg 


tacgcaccgc 


180 


taaaacgcgt 


tcagcgcgat 


cacggcagca 


gacaggtaaa 


aatggcaaca 


aaccacccga 


240 


aaaactgccg 


cgatcgcgcc 


tgataaattt 


taaccgtatg 


aatacctatg 


caaccagagg 


300 


gtacaggcca 


cattaccccc 


acttaatcca 


ctgaagctgc 


catttttcat 


ggtttcacca 


360 


tcccagcgaa 


gggccatcca 


gcgtgcgttc 


ctgtatttcc 


gactggcgcg 


ccattcaggt 


420 


ggcacttttc 


ggggaaatgt 


gcgcggaacc 


cctatttgtt 


tatttttcta 


aataca'ttca 


480 


aatatgtatc 


cgctcatgaa 


ttaattctta 


gaaaaactca 


tcgagcatca 


aatgaaactg 


540 


caatttattc 


atatcaggat 


tatcaatacc 


atatttttga 


aaaagccgtt 


tctgtaatga 


600 


aggagaaaac 


tcaccgaggc agttccacag 


gatggcaaga 


tcctggtatc 


ggtctgcgat 


660 


tccgactcgt 


ccaacatcaa 


tacaacccat 


taatttcccc 


tcgtcaaaaa 


taaggttatc 


720 


aagtgagaaa 


tcaccatgag 


tgacgacrga 


atccggtgag 


aatggcaaaa 


gtttatgcat 


780 


ttctttccag 


acttgttcaa 


caggccagcc 


attacgctcg 


tcatcaaaat 


cactcgcatc 


840 


aaccaaaccg 


ttattcattc 


gtgattgcgc 


ctgagcgaga 


cgaaatacgc 


gatcgctgtt 


900 


aaaaggacaa 


ttacaaacag 


gaatcgaatg 


caaccggcgc 


aggaacactg 


ccagcgcatc 


960 


aacaatattt 


tcacctgaat 


caggatactc 


ttctaatacc 


tggaatgctg 


ttttcccggg 


1020 


gatcgcagtg 


gtgagtaacc 


atgcatcatc 


aggagtacgg 


ataaaatgct 


tgatggtcgg 


1080 


aagaggcata 


aattccgtca 


gccagttrag 


tctgaccatc 


tcatctgtaa 


catcattggc 


1140 


aacgctacct 


ttgccatgtt 


tcagaaacaa 


ctctggcgca 


tcgggcttcc 


catacaatcg 


1200 


atagattgtc 


gcacctgatt 


gcccgacatt 


atcgcgagcc 


catttatacc 


catataaatc 


1260 


agcatccatg 


ttggaattta 


atcgcggcct 


agagcaagac 


gtttcccgtt 


gaatatggct 


1320 


cataacaccc 


cttgtattac 


tgtttatgta 


agcagacagt 


tttattgttc 


atgaccaaaa 


1380 


tcccttaacg 


tgagttttcg 


ttccactgag 


cgtcagaccc 


cgtagaaaag 


atcaaaggat 


1440 


cttcttgaga 


tccttttttt 


ctgcgcg~aa 


tctgctgctt 


gcaaacaaaa 


aaaccaccgc 


1500 


taccagcggt 


ggtttgtttg 


ccggatcaag 


agctaccaac 


tctttttccg 


aaggtaactg 


1560 


gcttcagcag 


agcgcagata 


ccaaatactg 


tccttctagt 


gtagccgtag 


ttaggccacc 


1620 


acttcaagaa 


ctctgtagca 


ccgcctacat 


acctcgctct 


gctaatcctg 


ttaccagtgg 


1680 


ctgctgccag 


tggcgataag 


tcgtgtccta 


ccgggttgga 


ctcaagacga 


tagtcaccgg 


1740 


ataaggcgca 


gcggtcgggc 


tgaacggggg 


gttcgtgcac 


acagcccagc 


ttggagcgaa 


1800 
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cgacctacac 


cgaactgaga 


tacctacagc 


gtgagctatg 


agaaagcgcc 


acgcttcccg 


1860 


aagggagaaa 


ggcggacagg 


tatccggtaa 


gcggcagggt 


cggaacagga 


gagcgcacga 


1920 


gggagcttcc 


agggggaaac 


gcctggtatc 


tttatagtcc 


tgtcgggttt 


cgccacctct 


1980 


gacttgagcg 


tcgatttttg 


tgatgctcgt 


caggggggcg 


gagcctatgg 


aaaaacgcca 


2040 


gcaacgcggc 


ctttttacgg 


ttcctggcct 


tttgctggcc 


ttttgctcac 


atgttctttc 


2100 


ctgcgttatc 


ccctgattct 


gtggataacc 


gtattaccgc 


ctttgagtga 


gctgataccg 


2160 


ctcgccgcag 


ccgaacgacc 


gagcgcagcg 


agtcagtgag 


cgaggaagcg 


gaagagcgcc 


2220 


tgatgcggta 


ttttctcctt 


acgcatctgt 


gcggtatttc 


acaccgcata 


tatggtgcac 


2280 


tctcagtaca 


atctgctctg 


atgccgcata 


gttaagccag 


tatacactcc 


gctatcgcta 


2340 


cgtgactggg 


tcatggctgc 


gccccgacac 


ccgccaacac 


ccgctgacgc 


gccctgacgg 


2400 


gcttgtctgc 


tcccggcatc 


cgcttacaga 


caagctgtga 


ccgtctccgg 


gagctgcatg 


2460 


tgtcagaggt 


tttcaccgtc 


atcaccgaaa 


cgcgcgaggc 


agctgcggta 


aagctcatca 


2520 


gcgtggtcgt 


gaagcgattc 


acagatgcct 


gcctgttcat 


ccgcgtccag 


ctcgi:tgagt 


2580 


ttctccagaa 


gcgttaatgt 


ctggcttctg 


ataaagcggg 


ccatgttaag 


ggcggttttt 


2640 


tcctgtttgg 


tcactgatgc 


ctccgtgcaa 


gggggatttc 


tgttcatggg 


ggtaatgata 


2700 


ccgatgaaac 


gagagaggat 


gctcacgata 


cgggttactg 


atgatgaaca 


tgcccggtta 


2760 


ctggaacgtt 


gtgagggtaa 


acaactggcg 


gtatggatgc 


ggcgggacca 


gagaaaaatc 


2820 


actcagggtc 


aatgccagcg 


cttcgttaat 


acagatgtag 


gtgttccaca 


gggtagccag 


2880 


cagcatcctg 


cgatgcagat 


ccggaacata 


atggtgcagg 


gcgctgactt 


ccgcgtttcc 


2940 


agactttacg 


aaacacggaa 


accgaagacc 


attcatgttg 


ttgctcaggt 


cgcagacgtt 


3000 


ttgcagcagc 


agtcgcttca 


cgttcgcrcg 


cgtatcggtg 


attcattctg 


ctaaccagta 


3060 


aggcaacccc 


gccagcctag 


ccgggtcctc 


aacgacagga 


gcacgatcat 


gcgcacccgt 


3120 


ggggccgcca 


tgccggcgat 


aatggccrgc 


ttctcgccga 


aacgtttggt 


ggcgggacca 


3180 


gtgacgaagg 


cttgagcgag 


ggcgtgcaag 


attccgaata 


ccgcaagcga 


caggccgatc 


3240 


atcgtcgcgc 


tccagcgaaa 


gcggtccccg 


ccgaaaatga 


cccagagcgc 


tgccggcacc 


3300 


tgtcctacga 


gttgcatgat 


aaagaagaca 


gtcataagtg 


cggcgacgat 


agtcatgccc 


3360 


cgcgcccacc 


ggaaggagct 


gactgggttg 


aaggctctca 


agggcatcgg 


tcgagatccc 


3420 


ggtgcctaat 


gagtgagcta 


acttacatta 


attgcgttgc 


gctcactgcc 


cgctttccag 


3480 


tcgggaaacc 


tgtcgtgcca 


gctgcatcaa 


tgaatcggcc 


aacgcgcggg 


gagaggcggt 


3540 
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ttgcgtattg 


ggcgccaggg 


tggtttttct 


tttcaccagt 


gagacgggca 


acagctgatt 


3600 


gcccttcacc 


gcctggccct 


gagagagttg 


cagcaagcgg 


tccacgctgg 


tttgccccag 


3660 


caggcgaaaa 


tcctgtttga 


tggtggttaa 


cggcgggata 


taacatgagc 


tgtcttcggt 


3720 


atcgtcgtat 


cccactaccg 


agatatccgc 


accaacgcgc 


agcccggact 


cggtaatggc 


3780 


gcgcattgcg 


cccagcgcca 


tctgatcgtt 


ggcaaccagc 


atcgcagtgg 


gaacgatgcc 


3840 


ctcattcagc 


atttgcatgg 


tttgttgaaa 


accggacatg 


gcactccagt 


cgccttcccg 


3900 


ttccgctatc 


ggctgaattt 


gattgcgagt 


gagatattta 


tgccagccag 


ccagacgcag 


3960 


acgcgccgag 


acagaactta 


atgggcccgc 


taacagcgcg 


atttgctggt 


gacccaatgc 


4020 


gaccagatgc 


tccacgccca 


gtcgcgtacc 


gtcttcatgg 


gagaaaataa 


tactgttgat 


4080 


gggtgtctgg 


tcagagacat 


caagaaazaa 


cgccggaaca 


ttagtgcagg 


cagcrtccac 


4140 


agcaatggca 


tcctggtcat 


ccagcggata 


gttaatgatc 


agcccactga 


cgcgttgcgc 


4200 


gagaagattg 


tgcaccgccg 


ctttacaggc 


ttcgacgccg 


cttcgttcta 


ccatcgacac 


4260 


caccacgctg 


gcacccagtt 


gatcggcgcg 


agatttaatc 


gccgcgacaa 


tttgcgacgg 


4320 


cgcgtgcagg 


gccagactgg 


aggtggcaac 


gccaatcagc 


aacgactgtt 


tgcccgccag 


4380 


ttgttgtgcc 


acgcggttgg 


gaatgtaatt 


cagctccgcc 


atcgccgctt 


ccactttttc 


4440 


ccgcgttttc 


gcagaaacgt 


ggctggcctg 


gttcaccacg 


cgggaaacgg 


tctgataaga 


4500 


gacaccggca 


tactctgcga 


catcgta-aa 


cgttactggt 


ttcacattca 


ccaccctgaa 


4560 


ttgactctct 


tccgggcgct 


atcatgccat 


accgcgaaag 


gttttgcgcc 


attcgatggt 


4620 


gtccgggatc 


tcgacgctct 


cccttatgcg 


actcctgcat 


taggaagcag 


cccagtagta 


4680 


ggttgaggcc 


gttgagcacc 


gccgccgcaa 


ggaatggtgc 


atgcaaggag 


atggcgccca 


4740 


acagtccccc 


ggccacgggg 


cctgccacca 


tacccacgcc 


gaaacaagcg 


ctcatgagcc 


4800 


cgaagtggcg 


agcccgatct 


tccccatcgg 


tgatgtcggc 


gatataggcg 


ccagcaaccg 


4860 


cacctgtggc 


gccggtgatg 


ccggccacga 


tgcgtccggc 


gtagaggatc 


gagacctcga 


4920 


tcccgcgaaa 


ttaatacgac 


tcactatagg 


ggaattgtga 


gcggataaca 


attcccctct 


4980 


agaaataatt 


ttgtttaact 


ttaagaagga 


gatataccat 


gggcagcagc 


catcatcatc 


5040 


atcatcacag 


cagcaqccta 


ataccacacc 

3 3 ^~\f 




y y ^- uay La 0 y 




jIUU 


agcaaatggg 


tcggatccga 


attcgagctc 


cgtcgacaag 


cttgcggccg 


cactcgagca 


5160 


ccaccaccac 


caccactgag 


atccggc^gc 


taacaaagcc 


cgaaaggaag 


ctgagttggc 


5220 
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tgctgccacc gctgagcaat aactagcata accccttggg gcctctaaac gggtcttgag 5280 
gggttttttg ctgaaaggag gaactatatc cggat 5315 

<210> 11 

<211> 4914 

<212> DNA 

<213> Artificial 



<220> 

<223> Plasmid pET8cTat 
<400> 11 



ttctcatgtt 


tgacagctta 


tcatcgaraa 


gctttaatgc 


ggtagtttat 


cacagttaaa 


60 


ttgctaacgc 


agtcaggcac 


cgtgtatgaa 


atctaacaat 


gcgctcatcg 


tcatcctcgg 


120 


caccgtcacc 


ctggatgctg 


taggcatagg 


cttggttatg 


ccggtactgc 


cgggcctctt 


180 


gcgggatatc 


gtccattccg 


acagcatcgc 


cagtcactat 


ggcgtgctgc 


tagcgctata 


240 


tgcgttgatg 


caatttctat 


gcgcacccgt 


tctcggagca 


ctgtccgacc 


gctttggccg 


300 


ccgcccagtc 


ctgctcgctt 


cgctact ugg 


agccactatc 


gactacgcga 


tcatggcgac 


360 


cacacccgtc 


ctgtggatat 


ccggatacag 


ttcctccttt 


cagcaaaaaa 


cccctcaaga 


420 


cccgtttaga 


ggccccaagg 


ggttatgcta 


gttattgctc 


agcggtggca 


gcagccaact 


480 


cagcttcctt 


tcgggctttg 


ttagcagccg 


gatccgttca 


ctaatcgaat 


ggatctgtct 


540 


ctgtctctct 


ctccaccttc 


ttcttctatt 


ccttcgggcc 


tgtcgggtcc 


cctcgggatt 


600 


gggaggtggg 


ttgctttgat 


agagaaactt 


gatgagtctg 


actgccttga 


ggaggtcttc 


660 


gtcgctgtct 


ccgcttcttc 


ctgccatagg 


agatgcctaa 


ggcttttgtt 


atgaaacaaa 


720 


cttggcaatg 


aaagcaacac 


tttttacaat 


agcaattggt 


acaagcagtt 


ttaggctgac 


780 


ttcctggatg 


cttccagggc 


tctagtc^ag 


gatctactgg 


ctccatggta 


tatctccttc 


840 


ttaaagttaa 


acaaaattat 


ttctagaggg 


aaaccgttgt 


ggtctcccta 


tagtgagtcg 


900 


tattaatttc 


gcgggatcga 


gatctcgatc 


ctctacgccg 


gacgcatcgt 


ggccggcatc 


960 


accggcgcca 


caggtgcggt 


tgctggcgcc 


tatatcgccg 


acatcaccga 


tggggaagat 


1020 


cgggctcgcc 


acttcgggct 


catgagcgct 


tgtttcggcg 


tgggtatggt 


ggcaggcccc 


1080 


gtggccgggg 


gactgttggg 


cgccatc~cc 


ttgcatgcac 


cattccttgc 


ggcggcggtg 


1140 
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ctcaacggcc 


tcaacctact 


actqqqccqc 

ZJ ZJ ZJ ZJ 


ttcctaatgc 


aggagtcgca 


taaqqqaqaq 


1200 


cgtcgaccga 


tgcccttgag 


agccttcaac 


ccagtcagct 


ccttccggtg 


qqcqcqqqqc 


1260 


atgactatcg 


tcgccgcact 


tatgactgtc 


ttctttatca 


tgcaactcgt 


aggacaggtg 


1320 


ccggcagcgc 


tctgggtcat 


tttcqqcqaq 


gaccgctttc 


qctqqacrcac 


gacgatgatc 


1380 


ggcctgtcgc 


ttgcggtatt 


cggaatcttg 


cacgccctcg 


ctcaagcctt 


cgtcactggt 


1440 


cccgccacca 


aacgtttcgg 


cgagaagcaq 


gccattatcg 


ccggcatggc 


ggccgacgcg 


1500 


ctgggctacg 


tcttgctggc 


gttcgcgacg 


cgaggctgga 


tggccttccc 


cattatgatt 


1560 


cttctcgctt 


ccggcggcat 


cgggatgccc 

zj z> ZD ZD 


qcqttqcaqq 

ZJ Zl Z3 :3 z) 


ccatgctgtc 


caggcaggta 


1620 


gatgacgacc 


atcagggaca 


gcttcaagga 


tcgctcqcgg 

ZJ ZD ZD ZD 


ctcttaccag 


cctaacttcg 


1680 


atcactggac 


cgctgatcgt 


cacggcgatt 


tatgccgcct 


cggcgagcac 


atcrcfaaccrcfcr 

*-* • y y w* \_*y y y 


1740 


ttggcatgga 

ZZ> z? ZJ Zj 


ttqtaqqcqc 


cgccctatac 


cttgtctgcc 


tccccacatt 


y i^y L.^.'ujL.Ljy l» 


1800 


gcatggagcc 


gggccacctc 


gacctgaatg 


qaaqccqqcq 

ZJ Zj Z^ ZJ _! 


gcacctcgct 


aacggattca 


1860 


ccactccaag 


aattggagcc 


aatcaatcct 


tgcggagaac 


tqtqaatqcq 


caaaccaacc 


1920 


cttggcagaa 


catatccatc 


gcgtccgcca 


tctccagcag 


ccqcacqcqq 


cgcacct egg 


1980 


gcagcgttgg 


gtcctggcca 


cgggtqcqca 

ZD ZD ^ ZJ Zy w " 


tgatcgtgct 


cctgtcgttg 


aggacccggc 


2040 


taggctggcg 


gggttgcctt 


actggttagc 


agaatgaatc 


accgatacgc 


era a caaaca t 


2100 


gaagcgactg 


ctgctgcaaa 


acgtctgcga 


cctgagcaac 


aacataaata 


rrfcr't"^" rrrrft* t" 


21 60 


tccgtgtttc 


gtaaagtctg 


gaaacqcqqa 

Zj ZD ZD ZD 


agtcagcgcc 


ctgcaccatt 


a t at izecacra 


2220 


tctgcatcgc 


aggatgctgc 


tggctaccct 


gtggaacacc 


tacatctgta 


ttaacgaagc 


2280 


gctggcattg 


accctgagtg 


atttttctct 


qgtcccqccq 

ZJ ZJ ^ ^ . 


catccatacc 


gccagttgtt 


2340 


taccctcaca 


acgttccagt 


aaccgggcat 


gttcatcatc 


agtaacccgt 


atca" aaara 


2400 


tcctctctcg 


tttcatcggt 


atcattaccc 


ccatgaacag 


aaatccccct 


t" a ca nnnfl rrn 

v«-u vy y Q y y 


2460 


catcagtgac 


caaacaggaa 


aaaaccgccc 


ttaacatggc 


ccgctttatc 


agaagecaga 


2520 


cattaacgct 


tctggagaaa 


ctcaacgagc 


tggacqcqqa 

» ZD ZD ZJ ZD ^ 


tgaacaggca 


gaca t;ctgtg 


2580 


aatcgcttca 


cgaccacgct 


gatgagcctt 


accgcagctg 


cctcgcgcgt 


ttcqgtqata 


2640 




ccucugacac 


atgcagcccc 


cggagacggt 


cacagcttgt 


ctgtaagcgg 


2700 


atgccgggag 


cagacaagcc 


cgtcagggcg 


cgtcagcggg 


tgttggcggg 


tgtcggggcg 


2760 


cagccatgac 


ccagtcacgt 


agcgatagcg 


gagtgtatac 


tggcttaact 


atgeggcate 


2820 
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agagcagatt 


gtactgagag 


tgcaccatat 


atgcggtgtg 


aaataccgca 


cagatgcgta 


2880 


aggagaaaat 


accgcatcag 


gcgctcttcc 


gcttcctcgc 


tcactgactc 


gctgcgctcg 


2940 


gtcgttcggc 


tgcggcgagc 


ggtatcagct 


cactcaaagg 


cggtaatacg 


gttatccaca 


3000 


gaatcagggg 


ataacgcagg 


aaagaacatg 


tgagcaaaag 


gccagcaaaa 


ggccaggaac 


3060 


cgtaaaaagg 


ccgcgttgct 


ggcgtttttc 


cataggctcc 


gcccccctga 


cgagcatcac 


3120 


aaaaatcgac 


gctcaagtca 


gaggtggcga 


aacccgacag 


gactataaag 


ataccaggcg 


3180 


tttccccctg 


gaagctccct 


cgtgcgcrct 


cctgttccga 


ccctgccgct 


taccggatac 


3240 


ctgtccgcct 


ttctcccttc 


gggaagcgtg 


gcgctttctc 


atagctcacg 


ctgtaggtat 


3300 


ctcagttcgg 


tgtaggtcgt 


tcgctccaag 


ctgggctgtg 


tgcacgaacc 


ccccgttcag 


3360 


cccgaccgct 


gcgccttatc 


cggtaactat 


cgtcttgagt 


ccaacccggt 


aagacacgac 


3420 


ttatcgccac 


tggcagcagc 


cactggtaac 


aggattagca 


gagcgaggta 


tgtaggcggt 


3480 


gctacagagt 


tcttgaagtg 


gtggcctaac 


tacggctaca 


ctagaaggac 


agtatttggt 


3540 


atctgcgctc 


tgctgaagcc 


agttaccttc 


ggaaaaagag 


ttggtagctc 


ttgatccggc 


3600 


aaacaaacca 


ccgctggtag 


cggtggtttt 


tttgtttgca 


agcagcagat 


tacgcgcaga 


3660 


aaaaaaggat 


ctcaagaaga 


tcctttgatc 


ttttctacgg 


ggtctgacgc 


tcagtggaac 


3720 


gaaaactcac 


gttaagggat 


tttggtcatg 


agattatcaa 


aaaggatctt 


cacctagatc 


3780 


cttttaaatt 


aaaaatgaag 


ttttaaatca 


atctaaagta 


tatatgagta 


aacttggtct 


3840 


gacagttacc 


aatgcttaat 


cagtgaggca 


cctatctcag 


cgatctgtct 


atttcgttca 


3900 


tccatagttg 


cctgactccc 


cgtcgtgtag 


ataactacga 


tacgggaggg 


cttaccatct 


3960 


ggccccagtg 


ctgcaatgat 


accgcgagac 


ccacgctcac 


cggctccaga 


tttaccagca 


4020 


ataaaccagc 


cagccggaag 


ggccgagcgc 


agaagtggtc 


ctgcaacttt 


atccgcctcc 


4080 


atccagtcta 


ttaattgttg 


ccgggaagct 


agagtaagta 


gttcgccagt 


taatagtttg 


4140 


cgcaacgttg 


ttgccattgc 


tgcaggcatc 


gtggtgtcac 


gctcgtcgtt 


tggtatggct 


4200 


tcattcagct 


ccggttccca 


acgatcaagg 


cgagttacat 


gatcccccat 


gttgtgcaaa 


4260 


aaagcggtta 


gctccttcgg 


tcctccgatc 


gttgtcagaa 


gtaagttggc 


cgcagtgtta 


4320 


tcactcatgg 


ttatggcagc 


actgcataat 


tctcttactg 


tcatgccatc 


cgtaagatgc 


4380 


ttttctgtga 


ctggtgagta 


ctcaaccaag 


tcattctgag 


aatagtgtat 


gcggcgaccg 


4440 


agttgctctt 


gcccggcgtc 


aacacgggat 


aataccgcgc 


cacatagcag 


aactttaaaa 


4500 


gtgctcatca 


ttggaaaacg 


ttcttcgggg 


cgaaaactct 


caaggatctt 


accgctgttg 


4560 
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